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1.  Introduction 

During  the  past  20  years,  progress  has  been  made  toward  putting  the  theor>^  of 
programming  on  a  firm  mathematical  basis.  McCabe  [6]  proposed  that  program 
complexity  be  measured  by  the  cyclomatic  number,  which  is  one  plus  the  number 
of  decision  nodes  in  the  program  flowchart.  Studies  have  been  done  to  determine 
how  well  such  measures  perform  in  practice  [1]  and  to  what  extent  they  predict 
the  number  of  programming  errors  [7]. 

In  a  much  cited  paper,  Dijkstra  [4]  proposed  that  the  GO  TO  statement  be 
eliminated  in  high-level  languages,  since  it  allowed  unnecessarily  complicated 
program,  structures.  It  was  suggested,  rather,  that  transfer  from,  one  part  of  the 
program  to  another  be  done  only  through  the  use  of  a  small  set  of  constructs.  One 
such  set  consists  of  the  DO- WHILE,  IF-THEN-ELSE,  and  SEQUENCE.  Flow- 


The  research  of  the  first  author  was  supported  by  the  National  Science  Foundation  under  Grant  MCS 
79-27060.  The  research  of  the  second  author  was  supported  by  a  National  Research  Council  Senior 
Postdoctoral  Associateship  while  the  second  author  was  tenured  at  the  Naval  Postgraduate  School, 
Monterey,  California. 

Authors  addresses:  E.  A.  Bender,  Department  of  Mathematics,  University  of  Califomia-San  Diego,  La 
Jolla,  CA  92093,  J.  T.  Butler,  Department  of  Electrical  Engineering  and  Computer  Science,  Northwest¬ 
ern  University,  Evanston,  IL  60201 

Permission  to  copy  without  fee  all  or  part  of  this  material  is  granted  provided  that  the  copies  are  not 
made  or  distributed  for  direct  commercial  advantage,  the  ACM  copyright  notice  and  the  title  of  the 
publication  and  its  date  appear,  and  notice  is  given  that  copying  is  by  permission  of  the  Association  for 
Computing  Machinery.  To  copy  otherwise,  or  to  republish,  requires  a  fee  and/or  specific  permission. 

©  1985  ACM  0004-541 1/85/0700-0537  $00.75 


Journal  of  the  Association  for  Computing  Machinery,  Vol.  32,  No.  3,  July  1985,  pp.  537-548. 


Report  Documentation  Page 

Form  Approved 

0MB  No.  0704-0188 

Public  reporting  burden  for  the  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information, 
including  suggestions  for  reducing  this  burden,  to  Washington  Headquarters  Services,  Directorate  for  Information  Operations  and  Reports,  1215  Jefferson  Davis  Highway,  Suite  1204,  Arlington 

VA  22202-4302.  Respondents  should  be  aware  that  notwithstanding  any  other  provision  of  law,  no  person  shall  be  subject  to  a  penalty  for  failing  to  comply  with  a  collection  of  information  if  it 
does  not  display  a  currently  valid  0MB  control  number. 

1.  REPORT  DATE 

JUL  1985  2.  REPORT  TYPE 

3.  DATES  COVERED 

4.  TITLE  AND  SUBTITLE 

Enumeration  of  Structured  Flowcharts 

5a.  CONTRACT  NUMBER 

5b.  GRANT  NUMBER 

5c.  PROGRAM  ELEMENT  NUMBER 

6.  AUTHOR(S) 

5d.  PROJECT  NUMBER 

5e.  TASK  NUMBER 

5f.  WORK  UNIT  NUMBER 

7.  PEREORMING  ORGANIZATION  NAME(S)  AND  ADDRESS(ES) 

Northwestern  University, Department  of  Electrical  Engineering  and 
Computer  Science, Evanston, IL, 60201 

8.  PEREORMING  ORGANIZATION 

REPORT  NUMBER 

9.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS(ES) 

10.  SPONSOR/MONITOR’S  ACRONYM(S) 

11.  SPONSOR/MONITOR’S  REPORT 
NUMBER(S) 

12.  DISTRIBUTION/AVAILABILITY  STATEMENT 

Approved  for  public  release;  distribution  unlimited. 

13.  SUPPLEMENTARY  NOTES 

14.  ABSTRACT 

An  analysis  of  structured  flowcharts  is  presented,  where  size  is  measured  by  the  number,  n,  of  decision 
nodes  (IF-THEN-ELSE  and  DO- WHILE  nodes).  For  all  classes  of  structured  flowcharts  considered,  the 
number  of  charts  is  approximately,  cn-?/*y?,  for  large  n,  where  c  and  y  are  parameters  that  depend  on  the 
class.  It  is  also  shown  that  most  large  flowcharts  consist  of  a  short  sequence  of  basic  charts 
(IF-THEN-ELSE  and  DO-WHILE  charts).  The  average  length  of  such  sequences  is  2.5. 

15.  SUBJECT  TERMS 

16.  SECURITY  CLASSIFICATION  OF:  17.  LIMITATION  OF 

18.  NUMBER  19a.  NAME  OF 

nu  PAr;FQ  pf^poniqipi  f  ppp^ont 

a.  REPORT  b.  ABSTRACT  c.  THIS  PAGE 

unclassified  unclassified  unclassified 

12 

Standard  Form  298  (Rev.  8-98) 

Prescribed  by  ANSI  Std  Z39-18 


538 


E.  A.  BENDER  AND  J.  T.  BUTLER 


charts  formed  from  these  are  called  D~charts,  after  Dijkstra.  Bohm  and  Jacopini 
[3]  showed  that  any  algorithm  can  be  implemented  as  a  D-chart. 

Knuth  [5]  and  others  have  advocated  less  restrictive  constructs  to  avoid  addi¬ 
tional  computation  sometimes  required  by  the  constraints  of  D-charts.  One  less 
restrictive  construct  is  the  DO- WHILE  with  one  or  more  midloop  exits.  Together 
with  the  IF-THEN-ELSE  and  SEQUENCE,  these  form  the  basis  of  the  -charts, 
after  Bdhm  and  Jacopini.  BJ^-charts  form  a  hierarchy.  That  is,  BJ^-charts  include 
BJ;„-i -charts,  which,  in  turn,  include  BJ;„_2-charts,  and  so  on.  At  the  lower  end  are 
the  BJi -charts,  which  are  identical  to  D-charts. 

Although  there  is  a  better  understanding  of  program  structure  and  complexity, 
the  full  range  of  the  programming  process  has  not  been  explored.  We  believe  that 
further  progress  depends  on  a  better  understanding  of  the  universe  from  which  all 
programs  are  produced.  That  is,  when  a  programmer  designs  a  program,  a  choice 
is  made  from  the  set  of  all  programs.  Thus,  there  is  the  question  of  whether  a 
programmer  produces  programs  that  correspond  to  a  random  choice  or  whether 
there  is  a  bias.  Our  results  indicate  the  latter  is  true. 

We  begin  by  enumerating  BJ^-charts.  It  is  shown  that  the  number  of  «-node 
charts  is  approximately  where  c  and  y  are  constants  near  0.9  and  9, 

respectively.  We  show  that,  in  large  D-charts,  approximately  56  percent  of  the 
decision  nodes  are  IF-THEN-ELSE,  while  the  remaining  44  percent  are  DO- 
WHILE.  In  large  charts,  where  one  or  more  midloop  exits  are  allowed  in  DO  loops, 
the  converse  is  true;  a  larger  percentage  of  nodes  is  DO- WHILE.  We  also  investigate 
the  composition  of  large  flowcharts.  That  is,  any  flowchart  corresponds  to  a 
sequence  of  basic  charts,  which  are  the  IF-THEN-ELSE  and  DO- WHILE  charts. 
It  is  shown  that  large  flowcharts,  on  the  average,  consist  of  a  small  number  (two  or 
three)  of  basic  charts.  Long  sequences  (of  more  than  eight  basic  charts)  are  very 
rare. 

2.  Preliminaries 

A  flowchart  is  a  directed  graph  where  nodes  represent  code  segments  and  arcs 
represent  transfer  of  control.  A  decision  node  is  a  node  with  indegree  1  and 
outdegree  2,  representing  a  code  segment  from  which  control  proceeds  to  one  of 
two  points,  depending  on  the  value  of  a  predicate.  Figure  la  shows  a  decision  node 
and  Figure  lb  shows  the  representation  used  in  this  paper. 

Note  that  interchanging  T  (true)  and  F  (false)  in  Figure  la  is  identical  to  replacing 
P  with  its  complement.  Since  all  possible  predicates  are  allowed,  it  is  appropriate 
to  just  omit  T  and  F,  as  is  done  in  Figure  lb.  The  other  type  of  node,  the 
reconvergent  node,  has  indegree  2  and  outdegree  1,  as  shown  in  Figure  Ic.  It 
represents  a  code  segment  to  which  flow  of  control  converges.  We  deal  in  this  paper 
exclusively  with  one-input  one-output  flowcharts.  It  follows,  therefore,  that  the 
number  of  decision  nodes  is  identical  to  the  number  6f  reconvergent  nodes. 
However,  this  paper  focuses  on  the  decision  node  and,  from  now  on,  the  term 
node  will  refer  to  a  decision  node. 

2.1.  D-Charts.  D-charts  are  built  up  from  smaller  D-charts  in  one  of  three 
ways,  as  shown  in  Figure  2.  Figure  2a  shows  the  IF-THEN-ELSE  construct.  A 
chart  of  the  form  shown  in  Figure  2a  is  called  an  IF-THEN-ELSE  flowchart,  and 
n  is  called  an  IF-THEN-ELSE  node.  Interchanging  G\  and  G2  leaves  the  chart 
unchanged,  since  it  is  the  same  as  complementing  the  predicate  at  rj.  A  flowchart 
of  the  form  in  Figure  2b  is  called  a  DO-WHILE  flowchart  and  v  is  called  a  DO- 
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A  Y 


(b) 


(c) 


Fig.  1.  Flowchart  nodes,  (a)  and  (b)  Decision 
node,  (c)  Reconvergent  node. 


Fig.  2.  The  iF-THEN-ELSE,  DO-WHILE, 
and  SEQUENCE  constructs,  (a)  IF-THEN- 
ELSE.  (b)  DO- WHILE,  (c)  SEQUENCE. 


(a)  (b)  (c) 


WHILE  node.  Figure  2c  shows  the  SEQUENCE  flowchart,  built  up  by  concaten¬ 
ating  IF-THEN-ELSE  or  DO- WHILE  flowcharts. 

Definition  1.  D-Chart: 

( 1 )  A  directed  arc  is  a  D-chart. 

(2)  For  h  >  the  SEQUENCE  of  Gi,  G2,  Gh  is  a  D-chart  where  G,  is  an 
IF-THEN-ELSE  or  DO- WHILE  chart  with  subcharts  that  are  D-charts  for 
I  <  /  <  /z. 

Let  the  sequency  of  a  D-chart  be  the  value  of  h.  It  is  convenient  to  let  the 
sequency  of  a  directed  arc  be  0.  Charts  of  sequency  1  are  precisely  the  IF-THEN- 
ELSE  and  DO- WHILE  charts.  .All  charts  with  0,  1,  and  2  decision  nodes  are  shown 
in  Figure  3.  Figure  3a  shows  a  chart  with  sequency  0,  Figure  3b  shows  charts  with 
sequency  2,  Figure  3c  single-node  charts  with  sequency  1  and  Figure  3d  double¬ 
node  charts  with  sequency  1 . 

Not  all  flowcharts  are  D-charts.  Figure  4  is  not  realizable  as  any  combination  of 
IF-THEN-ELSE,  DO- WHILE,  and  SEQUENCE  for  any  integer  7  >  1. 

2.2.  BJ^-Charts.  The  construct  shown  in  Figure  4  was  introduced  by  Bohm 
and  Jacopini  [3]  as  a  model  for  the  midloop  exit  of  a  DO  loop.  A  flowchart  of  the 
form  shown  is  called  a  DO-WHILE-WITH-j -EXITS flowchart.  ??i,  •  •  • ,  r?;  are 

called  DO-WHILE-WITH-j -EXITS  nodes.  Note  that  the  DO- WHILE  and  DO- 
WHILE- WITH- 1 -EXIT  flowcharts  are  identical. 

Definition  2.  BJm-Chart  (m  a  positive  integer  or  00): 

(1)  A  directed  arc  is  a  Bi;,, -chart. 

(2)  For  /z  >  1,  the  SEQUENCE  of  Gi,  G2, . . . ,  Gh  is  a  BJ^-chart,  where  each  G/  is 
either  an  IF-THEN-ELSE  chart  with  BJ^ -subcharts  or  a  DO- WHILE- WITH- 
7 -EXITS  chart  {j  <  m)  with  BJ,„ -subcharts. 

The  sequency  of  a  BJ;„ -chart  is  0  (respectively,  /z)  if  it  satisfies  Part  1  (respectively, 
2)  of  Definition  2. 
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(C)  (d) 

Fig.  3.  All  D-charts  with  0,  1,  and  2  decision  nodes,  (a)  Flowchart  with 
sequency  0.  (b)  Flowcharts  with  sequency  2.  (c)  Single-node  flowcharts  with 
sequency  1.  (d)  Double-node  flowcharts  with  sequency  1. 


Fig.  4.  The  DO- WHILE- WITH-y-EXITS  construct. 


3.  Flowchart  Enumeration  by  Generating  Functions 

Let  fm{n)hQ  the  number  of  BJ^-charts  with  n  decision  nodes.  For  example,  from 
Figure  3  we  have  for  BJi  =  D-charts, /I (0)  =  —  2,  and /i(2)  =  8.  Let  Fmix) 

be  the  generating  function  for  BJ^ -charts;  that  is, 

Fm{x)^fm{0)+fm{i)X+fm{2)x^+  •••  + /m(/)x'  +  .  .  . ,  (1) 

where  a:  is  a  formal  variable.  An  expression  for  Fm{x)  will  be  obtained  from 
ITE^(x)  and  DWW^(a:),  the  generating  functions  for  the  number  of  BJ^-charts 
which  are  IF-THEN-ELSE  and  DO- WHILE- WITH-m -EXITS  flowcharts,  re¬ 
spectively. 

Consider  DWW;„(a:)  first.  Distinct  choices  for  Gi,  Gj,  . . . ,  Gj  in  Figure  4  result 
in  distinct  DO- WHILE- WITH-y -EXITS  flowcharts.  Thus, 

m 

DWW„(;c)  =  E  x’FUx). 


(2) 
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Here,  x^F^rnM  corresponds  to  the  contribution  of  the  DO- WHILE- WITH-j- 
EXITS  flowcharts.  Fm{x)  is  the  contribution  from  each  of  the  subcharts,  while  x^ 
is  the  contribution  from  the  j  decision  nodes.  From  (2), 


DWW^(x)  = 


XFnr{x){\  -  X^F^{X)) 
1  -  xFm{x) 


(3) 


where  is  interpreted  as  0. 

Consider  ITE;;,(x).  If  distinct  choices  for  subcharts  Gi  and  G2  resulted  in  a 
distinct  IF-THEN-ELSE  flowchart,  then  ITE^(x)  would  be  The  gener¬ 

ating  function  for  those  charts  with  Gi  =  G2  is  xFm(x^).  Since  interchanging  G\ 
and  G2  produces  the  same  flowchart, 


Since  a  BJ^ -chart  is  a  SEQUENCE  of  some  number  i  of  IF-THEN-ELSE  and 
DO- WHILE- WITH-m-EXITS  flowcharts,  the  generating  function  for  BJ^-charts 
can  be  written  as 


Fm{x)  =  s  llTEn,ix)  +  DWW^(x)]'. 
/=0 

Thus, 

FUx)  =  [1  -  ITEM  -  DWW^(jc)]-‘. 


Substituting  (3)  and  (4)  into  (6)  yields 

/  X  r.  xFi,{x)  xFM) 


FUx)  =  1 


xFm{x) 


1  -  x^FM 
1  -  XFn,(x)  , 


(5) 

(6) 

(7) 


This  function  equation  can  be  used  to  obtain  the  coefficients  of  F^ix)  recursively, 
as  follows.  Let  Pd(x)  be  a  polynomial  of  degree  d  that  agrees  with  Fm{x)  through 
terms  of  degree  d.  Using  it  in  place  of  Fm(x)  in  the  right  side  of  (7)  and  truncating 
at  degree  d  +  1  produces  Pd-^\{x). 

Table  I  shows  the  results.  It  is  interesting  to  note  that,  for  a  fixed  «,  the  number 
of  n  node  BJ^-charts  that  are  also  BJ^-i  is  quite  substantial  for  m>  2.  However, 
as  will  be  seen  shortly,  for  a  large  enough  «,  BJ^_rcharts  represent  an  arbitrarily 
small  fraction  of  BJ^-charts. 


4.  Asymptotic  Approximations  to  the  Number  of  BJ^-Charts 

In  this  section,  we  derive  an  asymptotic  approximation  for  fm{n).  We  will  use 
Theorem  5  of  Bender  [2]: 

Theorem  1  [2,  p.  502].  Assume  the  power  series  w(x:)  =  with 

nonnegative  coefficients  satisfies  G{x,  w)^Q  in  which  w  and x  are  related  implicitly. 
Suppose  there  exist  real  numbers  r>0  and  s>  ao  such  that 

(/)  for  some  A  >  0,  G(x,  w)  is  analytic  /or  j  jc  |  <  r  +  A  and  |  w  |  <  5  +  A, 

(//)  G(r,  s)  =  G^{r,  s)  =  0, 

{Hi)  Gx{r,  5)  5^  0  and  G^^{r,  s)  7^  0,  and 

(/v)  if\x  \  <  r,  I  w  I  <5,  and  G{x,  w)  =  G,^{x,  w)  =  0,  then  x  =  r  and  w-  s. 
Then, 


a, 


(8) 


542 


E.  A.  BENDER  AND  J.  T.  BUTLER 


TABLE  1.  The  Number  f„{n)  of  BJ^-Charts  versus  the  Number  n  of  Decision  Nodes 


fi(n) 

fiin) 

Mn) 

A(n) 

m 

/4«) 

n 

D  =  BJ, 

Bh 

BJ3 

BJ4 

BJs 

BJc 

0 

1 

1 

1 

1 

1 

1 

1 

2 

2 

2 

2 

2 

2 

2 

8 

9 

9 

9 

9 

9 

3 

43 

53 

54 

54 

54 

54 

4 

258 

347 

359 

360 

360 

360 

5 

1,682 

2,463 

2,584 

2,598 

2,599 

2,599 

6 

11,529 

18,358 

19,526 

19,680 

19,696 

19,697 

7 

82,058 

141,959 

153,026 

154,602 

154,793 

154,812 

8 

600,320 

1,127,755 

1,231,851 

1,247,453 

1,249,517 

1,249,770 

9 

4,487,352 

9,150,633 

10,126,949 

10,278,871 

10,300,132 

10,303,076 

10 

34,120,281 

75,508,725 

84,658,561 

86,124,597 

86,338,009 

86,370,016 

for 


and 


c  = 


rGxjr,  s) 
lirG  ww  sy 


1/2 


7  = 


(9) 


(10) 


With  w  playing  the  role  of  Fm{x)  in  (7),  we  have, 


Gix,  w)  =  ^ 


xw^ 


+ 


X^FntiX^) 


-  2x]w^  +  [x  -\-  I  - 


Differentiating  G  with  respect  to  w  yields 

Gw{x,  w)  —  {m  +  +  2x^w^  - 


xFmjx^) 
2 

3xw^ 


w  -  1  =  0.  (11) 


+  {X^FmiX^)  -  4x)w  +  X  +  1  - 


xFm{X^) 


(12) 


Given  x  =  r  and  w  =  5,  satisfying  the  conditions  of  Theorem  1,  we  have, 


Gx{r,  s)  =  {m  +  l)r"’5'”'^^  +  rs^  -  — 


+  (r^FUr^)  +  rFnrir^)  -  2)s^  +  (  1  -  r^FUr^)  - 


Fm{r^) 


s  (13) 


and 


Gy^{r,  s)  =  {m  +  2){m  +  l)r'”'^'5"’  +  -  3rs  +  r^Fmir'^)  -  Ar.  (14) 

The  values  of  r,  s,  Fw(r^),  and  F^G^)  were  found  numerically  as  in  [2,  Example 
7.2],  resulting  in  the  asymptotic  expressions  for  Fm{n)  shown  in  Table  II.  The  error 
in  the  approximations  for  /?  =  10  ranges  from  6.1  percent  foryi(lO)  to  7.4  percent 
for /4 10). 


b 

O 


in  -H  tT 
oo  rsi 
r-  m  lo 

«  VO 


vo  O  m  00 

oo  <N  Ov  m 

r~  W)  VO  — 

«-  -<  vq  re 

O  O  O  <N 


r-  VO  r-  rs 

oo  — I  ov 

in  VO  — 
-H  VO 


Tt  GO  oo 

C3V  <5v  O 

^ 

—  —  VO 
O  O  O 


OO  m  <?\ 
fS  (N  •«^■ 
oo  Tj-  r- 
— ,  VO 


in  O  vn 
— '  m  in 
0  —  00 
<N  —  VO 
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5.  Properties  of  Large  BJm-C harts 

In  this  section,  we  calculate  how  many  nodes  there  are  of  various  types,  how  many 
flowcharts  there  are  with  various  sequences,  and  the  average  sequency  of  rt-node 
flowcharts,  where  «  c».  In  the  following  analysis,  we  consider  generating  functions 

A{x)  with  explicit  representations.  Thus,  a  special  case  of  Darboux’s  theorem 
(stated  in  [2,  Theorem  4])  will  be  useful. 

Theorem  2  [2,  p.  498].  Suppose  A{x)  -  S«=o  cinX"'  is  analytic  near  0  and  can 
be  written  in  the  form 

1  -  g(x),  (15) 

where  a  is  the  only  singularity  of A{x)  such  that  \  a\  <\  ^\for  all  other  singularities 
of  Aix).  Further,  if  g  and  h  are  analytic  near  a,  co  5«^  0,  -1,  -2, . . . ,  and  ^ 
0,  then 


A{x)  =  hix)  + 


r(co)«”  ’ 


(16) 


where  r(co)  is  the  gamma  function. 

Theorem  2  shows  that  the  asymptotic  behavior  of  coefficients  of  .x"  in  A{x)  is 
determined  by  the  singularity  of  least  magnitude. 

In  deriving  Theorem  1,  Bender  [2,  p.  505]  shows  that  >v(x)  has  a  power  series 
expansion  in  powers  of  (1  -  x/rf^^  near  x  -  r  beginning  with 


s  — 


/2r5^y/Yj  _  X 
\  Gwwi.r'i  s)  J  \  r^ 


1/2 


(17) 


This  applies  to  w  =  F„j(x)  by  the  result  of  the  previous  section.  Near  (x,  w)  = 
(r,  s),  Gw{x,  w)  has  a  power  series  expansion  beginning  with 


GUl  s)  +  Gxw(r,  s){x  -  r)  +  G^^{r,  5)(w  -  s).  (18) 


Combining  (17),  (18),  and  Gw{r,  s)  =  0,  we  see  that  Gh,(x,  w)  has  a  power  series 
expansion  in  (1  -  x/rf^^  beginning  with 


-  (2rGAr,  s)GMl  s)y'^ 


(19) 


5.1.  Average  Number  of  Nodes  of  Various  Types.  Consider  now  the 
calculation  of  the  average  number  of  nodes  of  various  types.  Let  €ite(/?)  and 
€dww(«)  be  the  fraction  of  nodes  in  /7-node  charts  that  are  type  IF-THEN-ELSE 
and  DO- WHILE-WITH-/77 -EXITS,  respectively.  Since  there  are  exactly  two  types 
of  nodes. 

«ite(«)  +  €dww(«)  =  1.  (20) 

We  solve  first  for  €nE{n)  and  then  use  (20)  to  find  €dww(«).  Let  Fm{x,  y)  = 
2  anpX^y^,  where  a„p  is  the  number  of  /z-node  flowcharts  p  of  which  are  IF-THEN- 
ELSE  nodes.  If  we  differentiate  Fm{x,  y)  with  respect  to  y,  and  set  =  1,  we  get 
Pm{x)  =  Yicini  +  2an2  +  ••.  +  nann)x”.  The  coefficient  of  x”  divided  by  the 
number  of  BJ^ -charts  with  n  nodes  is  the  average  number  of  nodes  that  are  IF- 


Enumeration  of  Structured  Flowcharts 


545 


THEN-ELSE.  Fn,{x,  y)  satisfies  the  functional  equation 
Fm{x,  y) 


xyFUx.  y)  xyF^ixf  y^)  1  - 

^  O  x^ m\X,  y) 


1  -  xF„(x, 

which  has  a  form  similar  to  (7)  except  that  x  in  the  factors  corresponding  to  IF- 
THEN-ELSE  nodes  is  replaced  by  xy.  We  have 


PUx)  = 


dFmjx,  y) 
dy 


N{x,  w) 


where 


N{x,  w)  = 


y=i  Gy,ix,  w)  ’ 

xFm{X^) 


(22) 


+  xPmix^)  VV, 


(23) 


and  G>v(x,  w)  is  given  by  (12).  By  (19)  and  (22),  the  series  expansion  of  Pm{x)  in 
powers  of  (1  —  xfr)^''^  begins  with 

-N(r,  s)(2rGAr,  s)GUr,  (*  "  f)  ' 


By  Theorem  2,  (9),  and  (24),  the  average  number  of  IF-THEN-ELSE  nodes, 
neiTEin)  satisfies 


neiTEin) 


N{r,  s)i2rGx{r,  5)Gw(r,  s))~^^^n 
{rGAr.  s)/27rGUr,  5))'/^«-'/^tT(1/2) 
Njr,  s) 
rGAr,  s) 


(25) 


Thus, 


€ITe(«) 


N(r,  s) 
rGx{r,  s) ' 


(26) 


Table  II  shows  cite(«)  and  €dww(«)  =  1  “  cite(«)  as  «  — >  oo  for  the  various 
flowcharts.  It  is  interesting  to  note  (by  comparing  D  and  BJ<»)  that  allowing  arbitrary 
midloop  exits  only  increases  the  fraction  of  DO-WHILE-type  modes  by  10  percent. 


5.2.  Number  of  IF-THEN-ELSE  and  DO- WHILE- WITH-m -EXITS 
Charts.  We  turn  now  to  the  calculation  of  flowchart  sequency.  Recall  that  the 
sequency  of  a  flowchart  /  is  the  number  of  IF-THEN-ELSE  and  DO-WHILE- 
WITH-m -EXITS  charts  concatenated  to  form  /  First  consider  charts  of  sequency 
1.  Such  charts  are  of  two  types.  The  generating  function  of  «-node  IF-THEN- 
ELSE  charts  is  given  in  (4).  Using  ( 1 7)  and  reasoning  as  before,  we  find  that  mite(  «), 
the  fraction  of  charts  that  are  type  IF-THEN-ELSE  is  given  as 


Mite(«)  ~  rs. 


(27) 


In  a  similar  fashion,  /i4dww(«),  the  fraction  of  charts  which  are  type  DO-WHILE- 
WITH-m-EXITS  can  be  derived  from  (3),  yielding, 


Mdww(«) 


(1  -  rsf 


(28) 
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Since  miteCw)  and  mdww(«)  account  for  all  «-node  flowcharts  with  sequency  1, 
the  fraction  of  charts,  /isQCY>i(«)  with  sequency  2  or  more  is  just 

MSQCY>i(w)  =  1  —  fXlTEin)  —  AlDWw(«).  (29) 

Table  II  shows  mite(«),  mdwwC^),  and  msqcy>i(«)  as  «  ^  oo.  Surprisingly,  mite/mdww 
is  not  close  to  €iteAdww* 


5.3.  Distribution  of  Flowcharts  with  Respect  to  Sequency.  We  extend 
these  results  to  count  the  flowcharts  with  various  sequencies.  Specifically,  the 
number  of  «-node  BJ;„-flowcharts  with  sequency  h  is  the  coefficient  of  x”  in 

Six)  =  [ITE^(x)  +  DWW^(x)]^ 


Let  bhm{n)  be  the  fraction  of  /t-node  BJ^ -charts  with  sequency  h.  Reasoning  as 
before. 


hmin) 


h 


rs^  rFmir^)  r5(l  -  ' 

T  2  1  -  rs 


X 


rs  -F 


mr^^2gm+i  _  +  \)r^-^^s^  +  r 

(1  -  rsf 


(30) 


Table  III  lists  the  values  of  bhmin)  for  «  ^  oo,  l  <  //  <  8,  and  m  =  1,  2,  3,  4,  5, 
and  00. 

Figure  5  shows  a  plot  of  bhm{n)  versus  h  for  two  flowchart  classes,  D  =  BJi  charts 
and  BJoo-charts.  It  can  be  seen  from  Table  III  and  Figure  5  that  the  average  sequency 
is  between  2  and  3.  These  averages  <r^(«),  can  be  calculated  by  multiplying  (30) 
by  h  and  summing  over  h.  The  results  are  shown  in  Table  II. 


6.  Concluding  Remarks 

We  approach  the  problem  of  enumerating  flowcharts  from  two  points  of  view.  In 
Section  3,  generating  functions  are  used  to  calculate  the  exact  number  of  flowcharts 
with  1  to  10  decision  nodes.  In  Section  4  we  develop  asymptotic  approximations 
to  the  number  of  BJ;„ -charts  and  find  that  it  grows  as  for  large  n,  where  c 

and  7  are  constants.  The  error  in  the  approximation  is  only  7  percent  at  «  =  10. 

We  observe  that  BJ^-i -charts  represent  a  vanishing  fraction  of  BJ^,, -charts  as 
«  00.  This  is  in  contrast  to  the  case  of  small  «  (1  <  n  <  10).  For  example,  for 

«  =  10,  BJ2-charts  represents  89  percent  of  the  set  of  BJs-charts. 

An  analysis  of  the  types  of  nodes  in  large  charts  shows,  for  example,  that  in 
D  =  BJ] -charts,  IF-THEN-ELSE  nodes  represent  about  56  percent  of  the  total, 
while  DO- WHILE  nodes  represent  the  remaining  44  percent.  It  is  interesting  to 
compare  this  with  the  fact  that  there  are  almost  twice  as  many  IF-THEN-ELSE 
charts  as  there  are  DO- WHILE  charts.  Thus,  in  a  SEQLfENCE  of  large  D-charts, 
we  would  expect  the  node  types  that  determine  the  subcharts  to  be  distributed  in 
a  different  proportion  than  the  set  of  all  nodes. 

Another  characteristic  investigated  is  sequency,  the  number  of  basic  subcharts 
in  a  SEQUENCE  that  composes  the  flowchart.  The  average  sequency  is  about  2.5. 
Thus,  long  chains  of  basic  charts  (IF-THEN-ELSE  and  DO-WHILE)  are  rare  in 
structured  flowcharts. 

We  have  observed  that  programmers  tend  to  produce  programs  with  large 
sequency.  It  would  seem  therefore  that  either 
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TABLE  III.  The  Proportion  hm{n)  of  BJ„-Charts  for  n  »  versus  Sequency  h 


Sequency  h 

h\{n) 

hiin) 

M«) 

hsin) 

M«) 

D  =  BJ, 

BJj 

BJa 

BJ4 

BJs 

BJ» 

1 

0.315 

0.325 

0.329 

0.330 

0.331 

0.331 

2 

0.276 

0.279 

0.281 

0.281 

0.281 

0.281 

3 

0.182 

0.180 

0.179 

0.179 

0.179 

0.179 

4 

0.107 

0.103 

0.102 

0.102 

0.102 

0.102 

5 

0.059 

0.055 

0.054 

0.054 

0.054 

0.054 

6 

0.031 

0.029 

0.028 

0.028 

0.028 

0.028 

7 

0.016 

0.014 

0.014 

0.014 

0.014 

0.014 

8 

0.008 

0.007 

0.007 

0.007 

0.007 

0.007 

Average  sequency  am 

2.4933 

2.4422 

2.4226 

2.4172 

2.4158 

2.4154 

Fraction 


Sequency 

Fig.  5.  Asymptotic  approximations  to  bh\{n)  and  hd.n),  the  fraction  of  n- 
node  flowcharts  with  sequency  h,  for  D  =  BJi-charts  and  for  BJ«-charts, 

respectively;  D  =  BJi-charts, - ;  BJ«,-charts, - .  Note  that  mite,  mdww, 

and  5a=o  are  the  values  to  which  mite(«),  mdww(«),  h\{n),  and  are 

asymptotically  constant. 


(a)  the  programs  we  are  solving  are  inherently  sequential,  or 

(b)  IF-THEN-ELSE  and  DO-WHILE  constructs  are  not  enough  to  help  us 
overcome  our  sequential  organizational  tendencies. 

A  better  understanding  of  these  issues  may  have  important  consequences  for  the 
design  of  programming  languages  and  our  abilities  to  handle  parallel  processing. 
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